Pelican: A Building Block for Exascale Cold Data Storage
نویسندگان
چکیده
A significant fraction of data stored in cloud storage is rarely accessed. This data is referred to as cold data; cost-effective storage for cold data has become a challenge for cloud providers. Pelican is a rack-scale harddisk based storage unit designed as the basic building block for exabyte scale storage for cold data. In Pelican, server, power, cooling and interconnect bandwidth resources are provisioned by design to support cold data workloads; this right-provisioning significantly reduces Pelican’s total cost of ownership compared to traditional disk-based storage. Resource right-provisioning in Pelican means only 8% of the drives can be concurrently spinning. This introduces complex resource management to be handled by the Pelican storage stack. Resource restrictions are expressed as constraints over the hard drives. The data layout and IO scheduling ensures that these constraints are not violated. We evaluate the performance of a prototype Pelican, and compare against a traditional resource overprovisioned storage rack using a cross-validated simulator. We show that compared to this over-provisioned storage rack Pelican performs well for cold workloads, providing high throughput with acceptable latency.
منابع مشابه
Feeding the Pelican: Using Archival Hard Drives for Cold Storage Racks
Microsoft’s Pelican storage rack uses a new class of hard disk drive (HDD), known by vendors as archival class HDD. These HDDs are explicitly designed to store cooler and archival data, differing from existing HDDs by trading performance for cost. Our early Pelican experiences have helped some vendors define the particular characteristics of this class of drive. During the last twelve or so mon...
متن کاملPerformance Impacts with Reliable Parallel File Systems at Exascale Level
The introduction of Exascale storage into production systems will lead to an increase on the number of storage servers needed by parallel file systems. In this scenario, parallel file system designers should move from the current replication configurations to the more space and energy efficient erasure-coded configurations between storage servers. Unfortunately, the current trends on energy eff...
متن کاملOptimization of thermal performance of external walls of residential building in cold and dry climate by Utilizing the Energy Simulation Software (Case Study: Mashhad, Iran)
The factors that can have a significant effect on the amount of solar energy received by the building are the material used in the external view and the lightning. The general objective of this research is to consider the existing climate conditions (Mashhad) in selecting and applying materials as well as the dimensions of the openings relative to the facade, taking into account the energy con...
متن کاملFlamingo: Enabling Evolvable HDD-based Near-Line Storage
Cloud providers and companies running large-scale data centers offer near-line, cold, and archival data storage, which trade access latency and throughput performance for cost. These often require physical rack-scale storage designs, e.g. Facebook/Open Compute Project (OCP) Cold Storage or Pelican, which co-design the hardware, mechanics, power, cooling and software to minimize costs to support...
متن کاملHigh-performance IO
Storage is becoming key in HPC systems, and especially when Exascale systems enter the game. The amount of data needed to solve the coming HPC challenges will not fit in memory, thus storage systems need to keep the pace of computing improvements; otherwise Exascale machines will waste energy waiting for the storage system to deliver the needed data. This research line investigates several path...
متن کامل